首页> 外文OA文献 >Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks

【2h】

Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks

机译：使用深度递归神经网络的噪声鲁棒文本到语音合成系统的语音增强

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Quality of text-to-speech voices built from noisy recordings is diminished. In order to improve it we propose the use of a recurrent neural network to enhance acoustic parameters prior to training. We trained a deep recurrent neural network using a parallel database of noisy and clean acoustics parameters as input and output of the network. The database consisted of multiple speakers and diverse noise conditions. We investigated using text-derived features as an additional input of the network. We processed a noisy database of two other speakers using this network and used its output to train an HMM acoustic text-to-synthesis model for each voice. Listening experiment results showed that the voice built with enhanced parameters was ranked significantly higher than the ones trained with noisy speech and speech that has been enhanced using a conventional enhancement system. The text-derived features improved results only for the female voice, where it was ranked as highly as a voice trained with clean speech.

机译：从嘈杂的录音中建立的文本到语音的质量下降。为了改善它，我们建议在训练之前使用递归神经网络来增强声学参数。我们使用包含噪声和干净声学参数的并行数据库作为网络的输入和输出来训练深度递归神经网络。该数据库由多个发言人和各种噪声条件组成。我们调查了使用文本衍生功能作为网络的附加输入的情况。我们使用此网络处理了其他两个扬声器的嘈杂数据库，并使用其输出来训练每种语音的HMM声学文本合成模型。聆听实验结果表明，使用增强参数构建的语音的等级明显高于使用嘈杂语音和使用常规增强系统增强的语音训练的语音。源自文本的功能仅针对女性语音改善了结果，在女性语音中，它的发音与经过纯净语音训练的语音一样高。

著录项

作者
Valentini Botinhao, Cassia; Wang, Xin; Takaki, Shinji; Yamagishi, Junichi;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks [J] . Li B., Sim K.C. Audio, Speech, and Language Processing, IEEE Transactions on . 2014,第8期

机译：深度神经网络的语音鲁棒语音识别频谱掩蔽方法
2. Prosody modeling for syllable based text-to-speech synthesis using feedforward neural networks [J] . Reddy V. Ramu, Rao K. Sreenivasa Neurocomputing . 2016,第JANa1期

机译：使用前馈神经网络进行基于音节的语音合成的韵律建模
3. Two-stage intonation modeling using feedforward neural networks for syllable based text-to-speech synthesis [J] . V. Ramu Reddy, K. Sreenivasa Rao Computer speech and language . 2013,第5期

机译：使用前馈神经网络的两阶段音调建模，用于基于音节的文本到语音合成
4. Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks [C] . Cassia Valentini-Botinhao, Xin Wang, Shinji Takaki, Annual Conference of the International Speech Communication Association . 2016

机译：使用深频神经网络的噪声强大的文本与语音合成系统的语音增强
5. Engineering Recurrent Neural Networks for Low-Rank and Noise-Robust Computation [D] . Stock, Christopher Hopkins. 2021

机译：用于低级和噪声稳健计算的工程经常性神经网络
6. Evaluation of Mixed Deep Neural Networks for Reverberant Speech Enhancement [O] . Michelle Gutiérrez-Muñoz, Astryd González-Salazar, Marvin Coto-Jiménez 2020

机译：混合深度神经网络对回响语音增强的评估
7. Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR [O] . Weninger, Felix, Erdoğan, Hakan, Erdogan, Hakan, 2015

机译：LSTM递归神经网络进行语音增强及其在鲁棒ASR中的应用
8. Text-To-Speech Phrasing Enhancement System Using Neural Networks [R] . Julig, L. F. 1995

机译：基于神经网络的文本语音语音增强系统

Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅